Model Selection

Document OCR Enhancement

# Document OCR Enhancement

PE Lang G14 448

The Perception Encoder is a state-of-the-art image and video understanding encoder trained through vision-language training, with strong generalization capabilities.

Eagle is a series of vision-centric high-resolution multimodal large language models, supporting input resolutions up to 1K and above, excelling in tasks such as optical character recognition and document understanding.

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase